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Adaptive  Control  of  Flat  MIMO  Nonlinear  Systems  With  Additive 

Disturbance 


Enver  Tatlicioglu,  Bin  Xian,  Darren  M.  Dawson,  and  Timothy  Burg 


Abstract:  In  this  paper,  two  controllers  are  developed 
for  flat  multi-input/multi-output  nonlinear  systems.  First,  a 
robust  adaptive  controller  is  proposed  and  proven  to  yield 
semi-global  asymptotic  tracking  in  the  presence  of  addi¬ 
tive  disturbances  and  parametric  uncertainty.  In  addition  to 
guaranteeing  an  asymptotic  output  tracking  result,  it  is  also 
proven  that  the  parameter  estimate  vector  is  driven  to  a 
constant  vector.  In  the  second  part  of  the  paper,  a  learning 
controller  is  designed  and  proven  to  yield  a  semi-global 
asymptotic  tracking  result  in  the  presence  of  additive  distur¬ 
bances  where  the  desired  trajectory  is  periodic.  A  continuous 
nonlinear  integral  feedback  component  is  utilized  in  the 
design  of  both  controllers  and  Lyapunov-based  techniques 
are  used  to  guarantee  that  the  tracking  error  is  asymptotically 
driven  to  zero.  Numerical  simulation  results  are  presented  for 
both  controllers. 

I.  Introduction 

Arguably,  an  interesting  control  problem  is  one  that  is  both 
challenging  from  a  theoretical  perspective  and  applicable 
to  real  systems  -  the  family  of  “flat”  nonlinear  systems 
appears  to  embody  both  of  these  properties.  A  flat  system 
is  characterized  by  a  dynamic  model  where  there  exists  a 
set  of  special  outputs  (equal  to  the  number  of  inputs)  such 
that  the  states  and  the  inputs  can  be  expressed  in  terms 
of  outputs  and  a  finite  number  of  its  derivatives  [14].  A 
surprising  number  of  practical  machines  match  this  form 
including  mobile  robots  and  cars,  cars  with  multiple  trailers, 
underwater  vehicles,  crane  systems,  induction  motors,  and 
planar  satellite/manipulator  systems  [4],  [7],  The  reader  is 
referred  to  [4]  and  [7]  for  a  more  detailed  explanation 
of  flatness  and  its  applications  to  physical  systems.  It  is 
the  case  of  multi-input  multi-output  (MIMO)  flat  systems 
with  parametric  uncertainty  and  bounded  disturbances  that 
is  considered  here.  Review  of  the  basic  control  problem 
suggests  and  disqualifies  certain  solutions.  It  is  probably  wise 
at  the  outset  to  discard  an  exact  model-based  control  ap- 
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proach  for  this  problem  given  that  any  parameter  estimation 
error  and  disturbances  are  not  directly  addressed,  and  hence, 
the  system  performance  and  stability  cannot  be  predicted 
a  priori.  Given  the  parametric  uncertainty  in  the  proposed 
class  of  systems  to  be  studied,  an  adaptive  control  solution 
may  be  warranted.  However,  an  adaptive  controller  designed 
for  a  disturbance  free  system  model  may  not  compensate 
for  the  disturbances  and  may  even  go  unstable  under  certain 
conditions.  Enhancing  the  adaptive  control  approach  with  a 
robust  component  to  form  a  robust  adaptive  controller  can 
generally  guarantee  closed-loop  signal  boundedness  in  the 
presence  of  the  additive  disturbances.  Unfortunately,  while 
a  robust  adaptive  controller  can  potentially  guarantee  the 
convergence  of  the  tracking  error  to  a  bounded  set  (i.e., 
the  tracking  error  can’t  necessarily  be  driven  to  zero)  the 
asymptotic  tracking  result  (where  the  tracking  error  is  driven 
to  zero)  that  would  be  shown  for  an  adaptive  controller 
applied  to  the  disturbance  free  model  will  be  lost.  These 
trade-offs  in  performance  and  robustness  have  framed  the 
last  ten  years  of  research  in  robust  adaptive  control. 

Review  of  relevant  work  highlights  some  of  the  different 
tacks  used  to  approach  this  problem.  An  adaptive  backstep- 
ping  controller  was  shown  by  Zhang  and  Ioannou  in  [18] 
for  a  class  of  single-input/single-output  (SISO)  linear  sys¬ 
tems  with  both  input  and  output  disturbances.  The  proposed 
controller  demonstrates  the  use  of  a  projection  algorithm  to 
bound  the  parameter  estimates  and  guarantees  an  ultimately 
bounded  tracking  error.  In  an  alternate  approach,  the  work  of 
Polycarpou  and  Ioannou  [16]  demonstrate  a  leakage-based 
adaptation  law  to  compensate  for  parametric  uncertainties. 
The  proposed  robust  adaptive  backstepping  controller  is 
applicable  to  a  class  of  higher-order  SISO  systems  with  un¬ 
known  nonlinearities.  The  suggested  control  law  guarantees 
global  uniform  ultimate  boundedness  of  the  system  state 
(with  some  restrictions  on  the  bounding  functions  of  the 
nonlinearities).  Robust  adaptive  control  laws  were  developed 
in  [5],  utilizing  the  modular  design  introduced  in  [12]  and  a 
tuning  function  design,  for  a  class  of  systems  similar  to  that 
studied  in  [16].  These  authors  show  estimates  on  the  effect 
of  the  bounded  uncertainties  and  external  disturbances  on  the 
tracking  error.  In  [9] ,  an  adaptive  backstepping  controller  for 
linear  systems  in  the  presence  of  output  and  multiplicative 
disturbances  is  designed.  Ikhouane  and  Krstic,  added  a 
switching  er-modification  to  the  tuning  functions  to  obtain 
a  tracking  error  proportional  to  the  size  of  the  perturbations. 
Marino  and  Tomei  [13]  proposed  a  robust  adaptive  tracking 
controller  that  achieves  boundedness  of  all  signals.  The  result 
is  based  on  a  class  of  SISO  nonlinear  systems  that  have  ad- 


ditive  disturbances  but  also  unknown  time-varying  bounded 
parameters.  It  is  significant  that  the  result  shows  arbitrary 
disturbance  attenuation.  In  [15],  Pan  and  Basar  proposed  a 
robust  adaptive  controller  for  a  similar  class  of  systems  in 
[13],  where  the  tracking  error  is  proven  to  be  C-2— bounded. 
In  [6],  Ge  and  Wang  proposed  a  robust  adaptive  controller 
for  SISO  nonlinear  systems  with  unknown  parameters  in  the 
presence  of  disturbances,  which  ensure  the  global  uniform 
boundedness  of  the  tracking  error. 

Most  of  the  research  in  adaptive  control  discussed  above 
has  focused  on  the  convergence  of  the  error  signals  and 
boundedness  of  the  closed-loop  system  signals.  As  the 
sophistication  in  adaptive  control  techniques  has  evolved, 
additional  questions  about  system  performance  have  arisen. 
Notably,  the  final  disposition  of  the  parameters  estimates 
in  the  closed-loop  system  has  been  examined.  It  is  well 
established  that  without  persistent  excitation  at  the  input, 
it  is  not  typically  possible  to  show  the  convergence  of 
the  parameter  estimates  to  the  corresponding  system  values 
(with  an  exception  being  a  least-squares  algorithm).  In  fact, 
for  gradient  and  Lyapunov-type  algorithms,  convergence  to 
a  constant  value,  is  typically  not  even  guaranteed.  Krstic 
summarizes  this  question  well  in  [11]  and  also  begins  to 
provide  some  answers.  In  [11],  it  is  shown  that  for  the 
proposed  adaptive  controller;  the  parameter  estimates  will 
reach  constant  values  after  a  sufficient  amount  of  time.  It  is 
shown  that  the  adaptation  mechanism  can  be  “turned  off” 
after  sufficient  time  and  that  the  learned  parameters  can  be 
used  in  a  non-adaptive  controller  of  the  same  structure  to 
stabilize  a  restart  of  the  system  from  new  initial  conditions. 
An  important  goal  of  the  present  work  is  to  include  a  state¬ 
ment  on  parameter  estimate  limits  for  a  controller  proposed 
for  the  flat  systems. 

A  recent  paper  by  Cai  et  al.  [2]  presented  a  robust  adaptive 
controller  for  MIMO  nonlinear  systems  with  parametric 
uncertainty  and  additive  disturbances.  With  some  restrictions 
placed  on  the  disturbances,  it  was  assumed  that  the  distur¬ 
bance  is  twice  continuously  differentiable  and  has  bounded 
time  derivatives  up  to  second  order,  the  proposed  controller 
was  proven  to  yield  an  asymptotic  output  tracking  result. 
However,  no  mention  of  the  convergence  of  the  parameter 
estimates  was  made.  Thinking  out  loud  for  a  moment,  it 
might  stand  to  reason  that  if  the  robust  part  of  the  controller 
is  compensating  for  the  disturbances  and  an  asymptotic 
tracking  result  is  obtained  then  perhaps  something  special  is 
happening  to  the  parameter  estimates.  Exploring  this  vague 
notion  with  mathematical  rigor,  we  will  show  that  with  a 
minor  modification  to  the  control  in  [2]  and  with  some 
additional  analysis  of  the  stability  result,  we  are  able  to 
formulate  a  new  conclusion  about  the  parameter  estimates. 
What  is  shown  is  that  this  robust  adaptive  controller  will 
yield  constant  parameter  estimates  even  in  the  presence  of 
the  disturbance.  The  stability  analysis  parallels  that  presented 
in  [2]  but  with  the  extended  analysis  the  convergence  of  the 
parameter  estimates  is  demonstrated.  The  main  contribution 
of  this  paper  is  to  add  to  the  small  number  of  results  where 
parameter  convergence  has  been  shown.  In  the  second  part 


of  the  paper,  a  learning  controller  for  the  same  class  of  flat 
systems  is  designed  under  the  assumption  that  the  reference 
trajectory  is  periodic  (for  past  research  related  to  the  design 
of  learning  controllers,  reader  is  referred  to  [1],  [3],  [8]  and 
the  references  therein).  This  controller  is  proven  to  yield 
a  semi-global  asymptotic  result  in  the  presence  of  additive 
disturbances.  In  the  design  of  both  controllers,  a  continuous 
nonlinear  integral  feedback  controller  (see  [17])  is  utilized 
and  Lyapunov-based  techniques  are  used  to  guarantee  that 
the  tracking  error  is  asymptotically  driven  to  zero.  Numer¬ 
ical  simulation  results  are  presented  for  both  controllers  to 
demonstrate  their  viability. 

II.  Adaptive  Control 
A.  Problem  Statement 

A  system  model  for  the  flat  nonlinear  systems  is  consid¬ 
ered  to  be  of  the  following  form 

*(")  =  f  +  G(u  +  d1)+d2  (1) 

where  x W  (t)  G  Rm,  i  =  0, ...,  (n  —  1),  are  the  system  states, 
f  €  Rm  and  G  (x,  x, ...,  x^n~1\  9)  G 

Kmxm  are  nonlinear  functions,  9  G  Rp  is  an  unknown 
constant  parameter  vector,  d\  (t)  ,  d2  (t)  G  Rm  are  unknown 
additive  nonlinear  disturbances,  and  u  ( t )  G  Rm  is  the  control 
input.  The  system  model  is  assumed  to  satisfy  the  following 
assumptions. 

Assumption  1:  The  nonlinear  function  G  (•)  is  symmetric, 
positive  definite  and  satisfies  the  following  inequalities 

222 1|£||2  <  £tM  (■)  £  <  to  (•)  ||£||2  V£  G  Rm  (2) 

where  M  {x,  x, ...,  a:*-”-1),  6)  G  Rmxm  is  defined  as 

M  =  G_1  (3) 

and  m  G  R  is  a  positive  bounding  constant, 
m  (x,  x, ...,  G  R  is  a  positive,  globally  invertible, 

nondecreasing  function  of  each  variable,  and  ||-||  denotes 
the  Euclidean  norm. 

Assumption  2:  The  nonlinear  functions,  /(•)  and  G(-), 
are  continuously  differentiable  up  to  their  second  derivatives 
(i.e.,  /(-),G(-)  GC2). 

Assumption  3:  The  nonlinear  functions,  /  (•)  and  M  (•), 
are  affine  in  9. 

Assumption  4:  The  additive  disturbances,  d\  ( t )  and 
d2  ( t ),  are  assumed  to  be  continuously  differentiable  and 
bounded  up  to  their  second  derivatives  (i.e.,  di  ( t )  G  C2  and 
di  (t) ,  di  ( t ) ,  di  (t)  G  Goo,  *  =  1,  2). 

The  output  tracking  error  ei  ( t )  G  Rm  is  defined  as  follows 

ei  =  xr  —  x  (4) 

where  xr  (t)  G  Rm  is  the  reference  trajectory  satisfying  the 
following  property 

xr  (t)  G  Cn  ,  4°  (t)  G  Goo  ,  *  =  0, 1, ...,  (n  +  2) .  (5) 

The  control  design  objective  is  to  develop  an  adaptive  control 
law  that  ensures  (t)  — >  0  as  t  — >  oo,  i  =  1 
and  that  all  signals  remain  bounded  within  the  closed-loop 


system.  To  achieve  the  control  objectives,  the  subsequent 
development  is  derived  based  on  the  assumption  that  the 
system  states  (t),  i  =  0, (n  —  1)  are  measurable. 

B.  Development  of  Robust  Adaptive  Control  Law 

The  filtered  tracking  error  signals,  ( t )  £  Rm,  i  = 

2,3,  ...,n  are  defined  as  follows 

e2  =  ei+  ei  (6a) 

e-3  =  e2+e2  +  ei  (6b) 


e-n  —  en_i  +  e„_i  +  en~2.  (6c) 

A  general  expression  for  e»,  i  =  2,3 in  terms  of  e\ 
and  its  time  derivatives  is  given  as  follows  [17] 


i- 1 


00 


ei  ~  °*Oel 

3=0 

where  the  constants  a,ij  are  defined  as  follows 


(7) 


a>i, o  =  B  (?)  = 


V5 


'l  +  V5N 


1  -  v/5N 


i  —  2,3 


a . 


i 

Q'i.i — 1 


=  k-j  +  1)  cik+j-ij-i 


k=  1 


3,4, ...,  n  ,  j  =  1,2,...,  (i  -  2) 
1  ,  i  =  1,2, ...,  n. 


(8) 

(9) 

(10) 


After  utilizing  (3),  the  system  model  can  be  rewritten  as 
follows 

Mz(n)  =  h  +  it  +  di  +  Md2 


where  h  (t)  £  lm  is  defined  as  follows 
h  =  Mf. 


(11) 


(12) 


To  facilitate  the  control  development,  the  filtered  tracking 
error  signal,  denoted  by  r  (t)  £  Rm,  is  defined  as  follows 

r  =  en  +  Aen  (13) 

where  A  £  Rmxm  is  a  constant,  diagonal,  positive  definite, 
gain  matrix.  After  differentiating  (13)  and  premultiplying  by 
M  (•),  the  following  expression  can  be  derived 

Mr  =  M  ^4n+1)  +  ^  anje[j+2)  +  Aen  j  (14) 
+Mx —  h  —  u—  d\  —  Md2  —  Md2 


where  (4),  (7)  and  the  first  time  derivative  of  (11)  were 
utilized.  The  dynamics  of  r  (t)  in  (14)  can  be  arranged  as 
follows 

Mr  =  ——Mr  —  en  —  ii  +  N  —  d\  —  Md2  —  Md2  (15) 


where  the  auxiliary  function  N  (x,  x, ...,  x^n\  f)  £  R.m  is 
defined  as  follows 

(71  —  2 

x(rn+i)  +  y 

+M  (x^  +  Tjf'j  +  e„  -  h. 


0U+ 2) 


+  Ae„ 


(16) 


To  facilitate  the  subsequent  analysis,  (15)  can  be  rearranged 
as  follows 

Mr  =  —  i.M) —  en— u  +  N  +  Nr+tl>  (17) 

where  N  (x,  x, ...,  x^n\  t)  ,  Nr  (t) ,  ip  (t)  £  are  defined 
as  follows 

N  =  (n  -  Md2-  Md2)  (18) 

—  —  Mrd2  —  Mrd^ 

Nr  ±  N\  .  .  (B)  („)  (19) 

'X=Xr,  X=Xr,...,  X^rL)  =Xr 

if  4  -d1-  Mrd2  -  Mrd2  (20) 


and  Mr  (t)  £  Rmxm  is  defined  as  follows 


Mr  =  Ml 


X=Xr,  X=Xr,.. 


c(n_  !)=xln 


(21) 


Remark  1:  By  utilizing  the  Mean  Value  Theorem  along 
with  Assumptions  2  and  4,  the  following  upper  bound  can 
be  developed 


Nf) 


<P  I 


(22) 


where  z  (t)  £  ]R(rl+1)mxl  is  defined  as  follows 


a  r  t 
=  [  ei 


T 

e2 


(23) 


and  p(-)  £  R>o  is  some  globally  invertible,  nondecreasing 
function. 

Remark  2:  After  utilizing  (5)  and  Assumption  4  along 
with  (20)  and  its  time  derivative,  then  it  is  clear  that  if  (t) , 

i>(t)  £  Coo. 

Remark  3:  After  utilizing  (5)  and  (16)  along  with  (19)  and 
its  time  derivative,  then  it  is  clear  that  Nr  (t) ,  Nr  (t)  £  Coo- 
Remark  4:  In  view  of  Assumption  3,  Nr  (•)  defined  in 
(19),  can  be  linearly  parameterized  in  the  sense  that 


Nr  =  wre 


(24) 


where  Wr  (t)  £  ]RmxP  is  the  known  regressor  matrix  and  is 
a  function  of  only  xr  (t)  and  its  time  derivatives. 

Based  on  (17)  and  (24),  the  control  input  is  designed  as 
follows 


u  =  (K  +  Im)  e„  (t)  -  (. K  +  Im)  en  (t0) 

+  [  (K  +  Im)  Aera  (r)  +  Wr  (t)  8  (r) 

Jt0  l 

+  (Cl  +  C2)  Sgn  (en  (r))]  dr  (25) 

where  8  (t)  £  W  is  generated  via 

8  =  T  f  (t)  Aen  (r)  dr -T  f  WJ  (r)  en  (r)  dr 

Jto  J  to 

+TWJ  {t)  en  (t)  -  TWj  (t0)  en  (to) 


(26) 


(36) 


with  K,  C\ ,  C2  £  lmxm  and  T  £  Rpxp  being  constant, 
diagonal,  positive  definite,  gain  matrices,  Im  £  Rmxm  being 
the  standard  identity  matrix,  and  Sgn(-)  being  the  vector 
signum  function  defined  as  follows 

Sgn(0  =  [  sgn(^)  sgn(£2)  ...  sgn(£m)  ]T 

VC  =  [  £l  C2  -  tm  ]T  •  (27) 

It  should  be  noted  that  0  (to)  =  0pxr  and  u(to)  =  0mxi 
where  0p><i  £  Rp  and  0mxi  £  Rm  are  vectors  of  zeros. 
Based  on  the  structure  of  (25)  and  (26),  the  following  are 
obtained 

ii  4  (K  +  Im)  r  +  (Ci  +  C2)  Sgn  (e„)  +  Wr9  (28) 

6  =  TWjr.  (29) 

Finally,  after  substituting  (28)  into  (17),  the  following  closed- 
loop  error  system  for  r  (t)  is  obtained 

Mr  =  Mr-en-(K  +  Im)r  +  Wr6  (30) 

—  (Cr  +  C2)  Sgn  (en)  +  N  +  if> 

where  the  parameter  estimation  error  signal  9  (t)  £  R.p  is 
defined  as  follows 

0  =  9-0.  (3i) 


C.  Stability  Analysis 

Theorem  1:  The  control  law  (25)  and  the  update  law  (26) 
ensure  the  boundedness  of  all  closed-loop  system  signals  and 
(t)  II  — >  0  as  t  — >  00,  i  =  0, n,  provided 


Amin  (A)  > 


1 

2’ 


(32) 


Cu>  Hi(t)\\Cao  +  —  ipi(t) 


(33) 


where  the  subscript  i  =  1  denotes  the  ?th  element 

of  the  vector  or  diagonal  matrix  and  the  elements  of  K 
are  selected  sufficiently  large  relative  to  the  system  initial 
conditions. 

Proof:  See  Appendix  I. 

Theorem  2:  There  exists  a  constant  vector  9a c  £  Rp  such 
that 

lim  9(f)  =0oo.  (34) 

t— >00 

Proof:  See  Appendix  II. 


dr 

d2 


cos  (2t)  +  exp  (— 0.5f ) 
sin  (3 1)  +  exp  (— 0.5t)  ’ 

sin  (2 1)  +  exp  (— 0.5f) 
cos  (3t)  +  exp  (— 0.5f) 


where  x  =  [  x±  X2  ] T.  The  nonlinear  disturbances  defined 
in  (36),  are  chosen  to  show  the  validity  of  the  proposed  con¬ 
troller  for  nonrepeating  disturbances.  The  reference  trajectory 
was  selected  as 


sin  t  ^1  —  exp 
2  sin  t  (l  —  exp 


(37) 


The  initial  conditions  of  the  system  were  set  to  x  (to)  = 
[  0.1  0.2  ]T  and  9  (to)  =  [  0  0  ]T,  while  the  controller 
parameters  were  chosen  as  A  =  /2,  K  =  20 12,  C\  =  10/2, 
C2  =  5 12,  and  T  =  20 /2  where  I2  £  K.2x2  is  the  standard 
identity  matrix.  In  Figures1  1  and  2,  the  state  x  (t)  and  the 
reference  trajectory  xr  (t)  are  presented,  respectively.  The 
tracking  error  e\  (t)  is  presented  in  Figure  3.  From  Figure  3, 
it  is  clear  that  the  tracking  objective  is  satisfied.  In  Figures  4 
and  5,  the  parameter  estimate  9  (t)  and  the  control  input  u  (t) 
are  presented,  respectively.  From  Figure  4,  it  is  clear  that  the 
parameter  estimate  vector  is  driven  to  a  constant  vector.  In 
Figures  6  and  7,  the  additive  disturbances  d\  (t)  and  d2  (t) 
are  presented,  respectively. 


III.  Learning  Control 
A.  Problem  Statement 

A  system  model  for  the  flat  nonlinear  systems  is  consid¬ 
ered  to  be  of  the  following  form 

®(")  =  /  +  G  (u  +  ch)  +  d2  (38) 

where  x W  (t)  £  Rm,  i  =  0, ...,  (n  —  1)  are  the  system  states, 
f(x,x,...,x(n~1'>)  £  Rm  and  G(x,x,  ...,x^n~1'>)  £  Rmxm 
are  nonlinear  functions,  d\  (t) ,  d2  (t)  £  Rm  are  unknown 
additive  disturbances,  and  u  (t)  £  Rm  is  the  control  input. 
The  system  model  is  assumed  to  satisfy  Assumptions  1.  2, 
and  4. 

The  output  tracking  error  ei  (t)  is  defined  in  (4)  and  in  this 
case  the  reference  trajectory  satisfies  the  following  property 


D.  Numerical  Simulation  Results 


A  numerical  simulation  was  performed  to  demonstrate  the 
performance  of  the  adaptive  controller  given  in  (25)  and  (26). 
A  first-order  flat  system  with  following  modelling  functions 
is  utilized  [2] 


/ 

9 


XiX2 

x% 


2  +  coscci  n 

,  G  = 

n  3  +  sm  x2 

L  92  \ 

(35) 


x(p  (t  +  T)  =  x^  (t)  ,  (t)  £  Coo 

i  =  0, 1, ...,  (n  +  2)  (39) 

where  T  £  R+  is  the  period  of  the  reference  trajectory. 

The  control  objective  is  to  develop  a  nonlinear  control 
that  ensures  \\ei  (f)||  — >  0  as  t  — >  00.  To  achieve  the 
control  objective,  the  subsequent  development  is  derived 
based  on  the  assumption  that  the  system  states  x^  (t), 
i  =  0, ...,  (n  —  1)  are  measurable. 

1  The  results  of  this  simulation  section  are  presented  in  Appendix  IV 


B.  Development  of  Learning  Control  Law 

The  open-loop  error  system  development  for  the  learning 
control  law  is  exactly  the  same  as  the  open-loop  error  system 
development  for  the  adaptive  control  law.  The  control  design 
is  assumed  to  continue  after  Remark  1  of  Section  II-B. 

Remark  5:  After  utilizing  (39)  and  Assumption  4  along 
with  (20)  and  its  time  derivative,  then  it  is  clear  that  ip  (t) , 
ip(t)  £  Coo- 

Remark  6:  After  utilizing  (16)  and  (39)  along  with  (19) 
and  its  time  derivative,  then  it  is  clear  that  Nr  ( t ) ,  Nr  (t)  £ 

OO  • 

Remark  7:  After  utilizing  (39),  it  is  clear  that  Nr  (t) 
satisfies  the  following  equation 

Nr  (■ t  +  T)  =  Nr  (t) .  (40) 

Based  on  (17),  the  control  input  is  designed  as  follows 

■u  ( t )  =  (K  +  Im)  en  ( t )  -  (K  +  Im)  en  (t0) 

+  f  [(K  +  Im)  Aen  (r)  +  CiSgn  (e„  (r ) )]  dr 
Jto 

+  Wr  (t)  (41) 

where  K,  C\ ,  A  £  K.mxm  are  constant,  diagonal,  posi¬ 
tive  definite,  gain  matrices,  Sgn(-)  is  defined  in  (27),  and 
Wr  ( t )  €  is  defined  as  follows 

Wr(t)  =  Wr(t-T)  +  kL  A  [  en  (t)  dr  (42) 

Jto 

+kLen  (t)  -  kLen  (t0) 

where  /c^  £  1  is  a  positive  gain.  It  should  be  noted  that 
since  Wr(to)  =  0mxi  it  follows  that  u  (to)  =  0mxi.  The 
auxiliary  function  Nr  (t)  £  ffi.1™  is  defined  as 

Nr  =Wr  .  (43) 

By  utilizing  (43)  along  with  (42),  the  following  can  be 
obtained 

Nr  (; t )  =  Nr  (t  -  T)  +  kLr  (t)  -  (44) 

Taking  the  time  derivative  of  (41)  along  (42)  and  (43) 
generates 

u  =  (K  +  Im)  r  +  CiSgn  (en)  +  Nr  (t) .  (45) 

Finally,  after  substituting  (45)  into  (17),  the  closed-loop  error 
system  for  r  ( t )  is  obtained  as  follows 

Mr  =  ~^Mr  -  en  -  {K  +  Im)r  (46) 
—CiSgn  (en)  +  N  +  Nr  +  if) 
where  Nr  ( t )  £  K.m  is  defined  as  follows 

Nr  =  Nr  —  Nr.  (47) 

By  utilizing  (40)  and  (44),  Nr  (t)  can  be  rewritten  as  follows 
Nr  (t)  =Nr(t-T)~  kLr.  (48) 


C.  Stability  Analysis 

Theorem  3:  The  control  law  (41)  and  (42)  ensures  that 
|| ei  (t)||  —>  0  as  t  — >  oo,  provided  that  (32)  and  (33)  are 
satisfied  and  the  elements  of  K  are  selected  sufficiently  large 
relative  to  the  system  initial  conditions. 

Proof:  See  Appendix  111. 

D.  Numerical  Simulation  Results 

A  numerical  simulation  was  performed  to  demonstrate  the 
performance  of  the  learning  controller  given  in  (41)  and 
(42).  The  flat  system  model  in  (35),  (36)  with  the  following 
reference  trajectory  is  utilized 


Xrl 

sin  (7 rf) 

Xr2 

cos  (7 rf) 

The  initial  conditions  of  the  system  were  set  to  x  (to)  = 
[  0.1  0.2  ]  ,  while  the  controller  parameters  were  chosen 

as  A  =  20 12,  K  =  20 12,  Cj  =  10 12,  and  kL  =  1.  In 
Figures2  8  and  9,  the  state  x  ( t )  and  the  reference  trajectory 
xr  (t)  are  presented,  respectively.  The  tracking  error  ei(£) 
is  presented  in  Figure  10.  From  Figure  10,  it  is  clear  that 
the  tracking  objective  is  satisfied.  In  Figures  11,  12,  13,  and 
14,  the  control  input  u  (t),  Wr  (t),  the  additive  disturbances 
di  ( t )  and  d,2  (t)  are  presented,  respectively. 

IV.  Conclusion 

Two  controllers  were  developed  for  flat  MIMO  nonlinear 
systems  in  the  presence  of  additive  disturbances.  The  ro¬ 
bust  adaptive  controller  was  proven  to  yield  a  semi-global 
asymptotic  tracking  result  in  the  presence  of  parametric 
uncertainty  along  with  additive  disturbances.  The  adaptive 
controller  and  the  adaptation  law  were  designed  such  that, 
the  parameter  estimate  vector  is  proven  to  go  to  a  con¬ 
stant  vector.  In  the  second  part  of  the  paper,  the  learning 
controller  was  proven  to  yield  a  semi-global  asymptotic 
result  in  the  presence  of  additive  disturbances  and  when  the 
desired  trajectory  is  periodic.  In  the  development  of  both 
controllers,  the  bounded  additive  disturbances  were  assumed 
to  be  twice  continuously  differentiable  and  have  bounded 
time  derivatives  up  to  second  order.  Since  no  assumptions 
were  made  regarding  the  periodicity  of  the  disturbances,  it 
is  clear  that  the  suggested  controllers  compensated  for  both 
repeating  and  nonrepeating  disturbances.  For  each  controller, 
Lyapunov-based  techniques  were  used  to  guarantee  that  the 
tracking  error  is  asymptotically  driven  to  zero.  Numerical 
simulation  results  were  presented  for  both  controllers  where 
nonrepeating  disurbances  were  utilized. 
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Appendix  I 
Proof  of  Theorem  1 

Lemma  1:  Let  the  auxiliary  functions  L\  ( t ),  L2  ( t )  £  R 
be  defined  as  follows 

Lx  =  rT  (xp-  CiSgn  (e„))  ,  L2  =  -elC2 Sgn  (e„) .  (50) 


If  Ci  is  selected  to  satisfy  the  sufficient  condition  (33),  then 


It0  Ll  (T) dT  -  Cm  ’  It0  L2  (T) dT  <  Cb2  (51) 

where  fbl,  (b2  €  R  are  positive  constants  defined  as 

m 

<61  -  Cli  \eni  I  -  en  (*0)  ll>  (t0) 

i=  1 
m 

C  b2  =  ^C'2l|e„i(<0)|.  (52) 

i—  1 

Proof:  After  substituting  (13)  into  (50)  and  then  inte¬ 
grating  L 1  (t)  in  time,  results  in  the  following  expression 


f  Lx(T)dT  =  j  e£(r)  AT  [xp  (r) 

J  to  J  to 

r))]  dr 
xp  (r)  dr 


(53) 


'to 

-CiSgn  (e„  (r))]  dr 

f  del  (' r) , 


dr 


'to 


/  to 


del ( T ) 

dr 


CiSgn  (en  (r))  dr . 


After  integrating  the  second  integral  on  the  right-hand  side 
of  (53)  by  parts,  the  following  expression  is  obtained 


[  Lx  (r)  dr  =  f  el  (t)  At  [xp  (r) 

J  to  J  to 


-C'iSgn  (en  (r))]  dr  +  el  (r)  xp  (r)  | 
-  /V(r)^A Lr 


I  to 


dr 


^  ^  i  |  P-ni  (7 


i= 1 
rt 


[  ^  (r)  AT  [V>  (r) 

Jto 


-A  lC— -  CiSgn  (e„  (r)) 
dr 

+en  (t)ip(t)  ~  el  (t0)xp(t0) 


dr 


-Sf2cu(\eni(t)\  -  |e„j(f°)|).  (54) 

i=  1 

The  right-hand  side  of  (54)  can  be  upper-bounded  as  follows 

ft  pt  ™> 

/  L\  (t)  dr  <  /  |eni  (r)|  Ai  [IV’j  (r)|  (55) 

Jt  n  Jtn 


i—X 


1 

~T. 


dip,  (r) 


dr 


-Cv 


dr 


+  Y  le™  (91  (I i’i  (91  -  Co)  +  C&1- 


If  C\  is  chosen  according  to  satisfy  (33),  then  the  first  on  (64)  can  be  obtained  as  follows 
inequality  in  (51)  can  be  proven  from  (55).  The  second  •  .  ..  ,.2  ,,  „  ,,  , 

inequality  in  (51)  can  be  obtained  by  integrating  L2(t)  ~  3  ~  “Hlrl!MI!~ll)l!~l 

defined  in  (50)  as  follows  \  .  /mil  i|2_' 


[  L2(t)  dr  =  -  [  e^(T)C'2Sgn(en(r))dr  (56) 

Jtn  dtn 


—  Cb2  —  le?li  —  ^b2- 


-Amin  (K)\\r\\2  -Y,^iC2i\eni  (t) | 


<  —  A3  — 


p2m ) 

4Amin  (K) 


^  '  ^iC2i  | &ni  (f ) 


The  following  is  the  proof  of  Theorem  1 . 

Proof:  Let  the  auxiliary  functions  Pi  (t),  P2  (f)  £  R  be 
defined  as  follows 

Pi  =  Cbi~  [  Li(r)dT  (57) 

J  to 

P2  ~  Cb2  f  L2(r)dT  (58) 

J  to 

where  L\  ( t ),  L2  (f) ,  Cbi  and  Cb2  were  defined  in  Lemma 
1 .  The  proof  of  Lemma  1  ensures  that  P\  (t)  and  P2  (t)  are 
non-negative.  The  non-negative  function  V  (s  (t) ,  t)  £  R  is 
defined  as  follows 

I  _  n  I  1  T 

V  =  -  ^  ef  e»  +  2rTMr  +  pi+p2+2e  T~ld  (59) 

i— 1 

where  s  (t)  £  ]g4(”+1)m+2+p]xi  is  defined  as  follows 

s=[zT  JTi  JP~2  f  Y  ■  (60) 

After  utilizing  (2),  (59)  can  be  bounded  as  follows 

W1(s)<V(s,t)<W2(s)  (61) 

where  W\  ( s ),  W2  (s)  £  R  are  defined  as  follows 

Wi  (s)  =  Xi  ||s||2  ,  W2  (s)  =  A2  (||s||)  ||s||2  (62) 

and3  Ai,  X2  (•)  £  R  are  defined  as  follows 

Ai  =  ^min{l,m,  Amin  (r-1)}  , 

A2  =  maxjl,  i?ri(||s||) ,  ^Amax  (r_1)|  .  (63) 

By  differentiating  (59),  the  following  expression  can  be 
obtained 

n—  1 

V  =  eTntien  +  e^’_1en  -  rTr  (64) 

i= 1 

+rT  N  -  rTKr  -  e^AC^Sgn  (en) 

where  (6a)-(6c),  (13),  (29),  (30)  and  (50)  were  utilized.  By 
using  (22),  (32),  and  the  triangle  inequality,  an  upper-bound 

3Using  (4)  and  (6a)-(6c)  it  can  be  shown  that  ||  (x,  x, ...,  x^n~ O)  ||  < 
t?(||s||)  where  $(•)  is  some  positive  function.  Thus,  fh(x,  x, ...,  < 

m(||4). 


where  A3  =  min  {A,  Amin  (A)  —  A}.  The  following  inequal¬ 
ity  can  be  developed 

m 

V  <W  (s)  —  ^2,h-iC2i  \eni  (i)|  (66) 

i—1 

where  W  (s)  £  R  denotes  the  following  non-positive  func¬ 
tion 

W(s)±-/30\\z\\2  (67) 

with  (30  £  R  being  a  positive  constant,  and  provided  that 
Amin  (K)  is  selected  according  to  the  following  sufficient 
condition 


Based  on  (59)-(63)  and  (65)-(67)  the  regions  D  and  S  can 
be  defined  as  follows 


Note  that  the  region  of  attraction  in  (70)  can  be  made  ar¬ 
bitrarily  large  to  include  any  initial  conditions  by  increasing 
Amin  (K)  (i.e.,  a  semi-global  stability  result).  Specifically, 
(62)  and  (70)  can  be  used  to  calculate  the  region  of  attraction 
as  follows 


which  can  be  rearranged  as 


By  utilizing  (23),  (52)  and  (60)  the  following  explicit  ex¬ 
pression  for  || s  (<o)||  can  be  derived  as  follows 

n 

l|s  (^o)ll2  =  5Zlle*(io)||2  +  lk(io)||2  (73) 

i=  1 

+Cbl  +  Cb2  +  ll^ll  • 


(77) 


From  (59),  (66),  (70)-(72),  it  is  clear  that  V  ( s,t )  G  Coo 
Vs  (to)  €  5;  hence  s  (t) ,  z  (i) ,  9  (t)  €  Coo  Vs  (to)  G  5. 
From  (66)  it  is  easy  to  prove  that  en  (t)  £  C\  Vs  (to)  G  5. 
From  (13),  it  is  clear  that  en  (t)  £  Coo  Vs  (to)  €  5.  By 
using  (4),  (5)  and  (7),  it  can  be  proved  that  (£)  g  LA, 
t  =  0,  Vs  (to)  €  5.  Then,  it  is  clear  that  M  (t) , 

M  (t) ,  /  (t)  G  £oo  Vs  (to)  €  5.  The  facts  that  r  (t) , 
0  (t)  €  £oo  Vs  (to)  G  5  can  be  used  along  with  (31)  and 


After  utilizing  (2),  (74)  can  be  bounded  as  follows 
Wi  (s)  <  V  (s,  t)  <  W2  (s) 
where  (s),  W2  (s)  £  R  are  defined  as  follows 

Wi  (s)  =  A,  ||s||2  ,  W2  (s)  =  A2  (||s||)  ||s||2  (78) 

and  Ai,  A2  (•)  £  R  are  defined  as  follows 


(29)  to  prove  that  9  (t) ,  9  (t)  £  Coo  Vs  (to)  G  S.  After 
using  these  boundedness  statements  along  with  (11)  and  (28), 
it  is  clear  that  u  (t) ,  u  (t)  £  Coo  Vs  (to)  G  S.  The  previous 
boundedness  statements  and  Remarks  1,  2,  3  can  be  used 
along  with  (17),  to  prove  that  r  (t)  £  Coo  Vs  (to)  G  S.  These 
boundedness  statements  can  be  used  along  with  the  time 
derivative  of  (67)  to  prove  that  W  (s  (t))  £  Coo  Vs  (to)  G  S; 
hence  W(s(t))  is  uniformly  continuous.  Standard  signal 
chasing  algorithms  can  be  used  to  prove  that  all  remaining 
signals  are  bounded.  A  direct  application  of  Theorem  8.4 
in  [10]  can  be  used  to  prove  that  ||z(i)||  — >  0  as  t  — »  00 
Vs  (to)  G  S.  Based  on  the  definition  of  z  (t),  it  is  easy  to 
show  that  He*  (<)|| ,  ||r  (<)||  — >  0  as  t  — >  00  Vs  (to)  G  S, 
i  =  1,2,  From  (13),  it  is  clear  that  ||en(t)||  — >  0  as 
t  — »  00  Vs  (to)  G  S.  By  utilizing  (7)  recursively  it  can 
be  proven  that  e'-^  (t)  — >  0  as  t  — >  00,  i  =  1,2 ,  ...,n 

Vs  (t0)  G  S.  m 

Appendix  II 
Proof  of  Theorem  2 

Proof:  The  fact  that  Wr  (t)  is  a  function  of  only  xr  (t) 
and  its  time  derivatives,  can  be  used  along  with  the  bound¬ 
edness  expression  in  (5),  to  show  that  Wr  (t) ,  Wr  (t)  g  Coo- 
After  considering  the  fact  that  en  (t)  G  C\  (see  the  proof  of 
Theorem  1),  it  is  clear  that  Wj  (t)  Ae„  (t) ,  Wj  (t)  en  (t)  G 
A.  This  assures  the  existence  of  the  limits  for  the  first  and 
second  terms  in  (26),  i.e.,  lim^oo  f  Wj  (r)  Aen  (r)  dr 
and  Hindoo  Wj  (r)  en  (r)  dr  exist  (see  Theorem  3.1 
of  [11]).  Based  on  the  fact  that  en  (t)  — »  0  as  t  — >  00 
Vs  (<o)  G  5  (see  the  proof  of  Theorem  1)  then  it  is  clear  that 
lim^oo  Wj  (t)  en  (t)  =  0.  Utilizing  the  above  facts  along 
with  the  fact  that  Wj  (to)  en  (to)  is  constant,  it  follows  that 
1  i  1 1 1  / — ,00  9  (t)  =  Boo-  ® 

Appendix  III 
Proof  of  Theorem  3 

Proof:  Let  V  (s,  t)  £  R  denotes  the  following  non¬ 
negative  function 

1  "  1 

F=-^efe,  +  -rTMr  +  P1  +  Vg  (74) 

Z  i=l 

where  Pi  (t)  was  defined  in  Lemma  1  and  Vg  (t)  £  R  is  a 
non-negative  function  defined  as  follows 

vg±^LfTN?(T)Nr(T)dT  (75) 

where  s  (t )  is  defined  as  follows 

s=[^T  VPi  VV9]T-  (76) 


Ai  —  -  min  {1,  to}  ,  A2  =  max  |l,  -m  (||s||)| .  (79) 

After  taking  the  time  derivative  of  (74),  the  following  ex¬ 
pression  can  be  obtained 


V  =  -^efei-el  Ae. 


n  T  'Cn—i  c-n 


kr 

rp  m  rji  f\j  /  .  rT1 

— r  r  +  r  N  —  r  Kr - r  r 


where  (6a)-(6c),  (13),  (46),  (48)  and  (50)  were  utilized.  By 
(22),  (32)  and  the  triangle  inequality,  an  upper-bound  on  (80) 
can  be  obtained  as  follows 

V  <  -A3||z||2  +  ||r||p(||z||)||z|| 
-(xmin(K)  +  ^-)  ||r||2 


/  (1  P2(IMD  \  „  „2 

-  "lA4“iA-wJNI  (81) 

where  A3  =  ruin  {A,  Ar„in  (A)  —  and  A4  = 
min  {A3,  4p}.  The  following  inequality  can  be  developed 

V  <  W  (s)  <  W  (s)  (82) 

where  W  (s) ,  W  (s)  G  R  denote  the  following  non-positive 
functions 

W(s)±-f30\\z\\2  ,  W  (s)  =  — /30  He, ||2  (83) 

with  /30  G  R  being  a  positive  constant,  and  provided  that 
Amin  (K)  is  selected  according  to  the  following  sufficient 
condition 

Amin  (K)  >  p2  (^ll}  (84) 

4A4 

or  ||z||  <  p_1  (2y/Xi Amin  (K)^j  . 

Based  on  (74)-(79)  and  (8 1  )-(83),  the  regions  D  and  S  can 
be  defined  as  follows 

V  =  |s  :  ||s||  <p-x  (2v/A4Ami„(A^}  (85) 

5  =  { s£V :  (86) 

W2(s)  <  Ai  (V1  (2N/A4Amin  (A")))2]  • 


V  = 


Note  that  the  region  of  attraction  in  (86)  can  be  made  ar¬ 
bitrarily  large  to  include  any  initial  conditions  by  increasing 
Amin  (K)  (i.e.,  a  semi-global  stability  result).  Specifically, 


(78)  and  (86)  can  be  used  to  calculate  the  region  of  attraction 
as  follows 


W2(s(t0))  <  Ai  (V1  (2x/A4Amin  (/V)))  (87) 

=►  \\s{to)\\<  '  ^ 


,  A2(||s(t0)||) 
p-1  (2n/A4A  mi„(iV)) 


which  can  be  rearranged  as 


Amin  {K)  -  4AlP 


1  „2  A2(||S(f0)||) 


Ai 


|s(*o)||  •  (88) 


By  utilizing  (23),  (52)  and  (76)  the  following  explicit  ex¬ 
pression  for  ||s  (io)||  can  be  derived  as  follows 

n 

Ik  (*o)||2  =  He,  (to)||2  +  Hr  (t0)||2  -h  Cbi •  (89) 

i= 1 


From  (74),  (82),  (86)-(88),  it  is  clear  that  V  ( s,t )  £  Coo 
Vs  (t0)  £  S;  hence  s  (t)  ,z  (t)  £  Coo  Vs  (to)  £  S.  From  (13), 
it  is  clear  that  en  (t)  £  Coo  Vs  (to)  £  S.  Using  (4)  and  (39),  it 
can  be  proved  that  (f)  £  Coo,  i  =  0, 1, n.  Vs  (to)  £  S. 
Then,  it  is  clear  that  M  (f) ,  M  (t) ,  f  (t)  £  Coo  Vs  (to)  £  S. 
By  using  these  boundedness  statements  along  with  (11)  it 
is  clear  that  u  (t)  £  Coo  Vs  (to)  £  S.  These  boundedness 
statements  can  be  used  along  with  the  time  derivative  of 

(83)  to  prove  that  W  (s(t))  £  Coo  Vs  (to)  £  S;  hence 
W  (s(t))  is  uniformly  continuous.  A  direct  application  of 
Theorem  8.4  in  [10]  can  be  used  to  prove  that  ||ei  (t)||-0 
as  t  — >  oo  Vs  (to)  £  S.  It  should  be  noted  that  for  finite  time 
the  subsequent  analysis  can  be  easily  extended  to  prove  that 
Nr  (t),  u  (t),  f  (t),  Nr  (t)  are  bounded.  ■ 

Remark  8:  It  should  be  noted  that  when  WT  (t)  is  de¬ 
signed  as  follows 


Wr  (t)  =  Sat ,3  (fVr  (r  -  T)J  +  kLAen  (i 

+kLen  (t)  -  (t0) 


dr 

(90) 


where  iVr  (t)  was  introduced  in  (43)  and  Sat/3  (•)  £  Km  is  a 
saturation  function  vector,  then  the  previous  analysis  can  be 
modified  to  prove  that  Nr  (t),  u  (t),  r  (t),  Nr  (t)  are  bounded 


for  all  time  and  thus 
l,...,n. 


Si) 


(t) 


converge  to  zero  for  i  = 


Appendix  IV 

Numerical  Simulation  Results 
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Fig.  1 .  (Adaptive  Controller)  State  x  (t) 


Xr1 


Xr2 


Fig.  2.  (Adaptive  Controller)  Reference  Trajectory  xr  (t) 
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Fig.  3.  (Adaptive  Controller)  Tracking  Error  e\  (t) 


Fig.  4.  (Adaptive  Controller)  Parameter  Estimate  0  (t) 
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Fig.  6.  (Adaptive  Controller)  Additive  Disturbance  d\  (t) 


Fig.  7.  (Adaptive  Controller)  Additive  Disturbance  cfe  (t) 


xi 


Fig.  5.  (Adaptive  Controller)  Control  Input  u  ( t ) 


Fig.  8.  (Learning  Controller)  State  x  ( t ) 


Xl2 


Fig.  9.  (Learning  Controller)  Reference  Trajectory  xr  (t) 


Fig.  12.  (Learning  Controller)  Wr  (£) 


Fig.  10.  (Learning  Controller)  Tracking  Error  ei(t)  Fi§'  13'  <Leaming  Controller)  Additive  Disturbance  di  (t) 


Fig.  11.  (Learning  Controller)  Control  Input  u(t)  Fig-  14.  (Learning  Controller)  Additive  Disturbance  d 2  ( t ) 


